A Modular On-line Profit Sharing Approach in Multiagent Domains
نویسندگان
چکیده
How to coordinate the behaviors of the agents through learning is a challenging problem within multi-agent domains. Because of its complexity, recent work has focused on how coordinated strategies can be learned. Here we are interested in using reinforcement learning techniques to learn the coordinated actions of a group of agents, without requiring explicit communication among them. However, traditional reinforcement learning methods are based on the assumption that the environment can be modeled as Markov Decision Process, which usually cannot be satisfied when multiple agents coexist in the same environment. Moreover, to effectively coordinate each agent’s behavior so as to achieve the goal, it’s necessary to augment the state of each agent with the information about other existing agents. Whereas, as the number of agents in a multiagent environment increases, the state space of each agent grows exponentially, which will cause the combinational explosion problem. Profit sharing is one of the reinforcement learning methods that allow agents to learn effective behaviors from their experiences even within non-Markovian environments. In this paper, to remedy the drawback of the original profit sharing approach that needs much memory to store each state-action pair during the learning process, we firstly address a kind of on-line rational profit sharing algorithm. Then, we integrate the advantages of modular learning architecture with on-line rational profit sharing algorithm, and propose a new modular reinforcement learning model. The effectiveness of the technique is demonstrated using the pursuit problem. Keywords—Multi-agent learning; reinforcement learning; rational profit sharing; modular architecture.
منابع مشابه
Studying the impact of quantity discount contract and cost-sharing contract on a two-echelon green supply chain profit
The members of a chain always try to find new ways in order to raise their profit. Hence we intend to study two different scenarios in a single item two-echelon green supply chain including two manufacturers and one retailer to study the effects of two effective contracts on members’ profit. Two scenarios are discussed and in first one, first manufacturer proposes quantity discount contract to ...
متن کاملThe revenue and preservation-technology investment sharing contract in the fresh-product supply chain:A game-theoretic approach
This research considers a fresh-product supply chain consisting of a single-buyer, a single-supplier for deteriorating products where the market demand is dependent on the retail price, fresh rate, and remaining rate. Firstly, in a competitive model, the primary decision variables (i.e., the supplier's wholesale price and preservation-technology investment and also buyer's order quantity and re...
متن کاملA Repetitive Control- based Approach for Power Sharing among Boost Converters in DC Microgrids
In this paper a repetitive control (RC) approach to improve current sharing between parallel-connected boost converters in DC microgrids is presented. The impact of changes in line impedance on current sharing is investigated. A repetitive controller is designed and connected in series with current controller of the boost converters to control the switching signals such that by regulating of th...
متن کاملبررسی سناریویهای مختلف اشتراک اطلاعات در زنجیره تامین با استفاده از شبیهسازی
As knowledge is power, information is power in supply chains. It (information) provides the decision maker the power to get ahead of the competition, the power to run a business smoothly and efficiently, and the power to succeed in an ever more complex environment. Information plays a key role in the management of the supply chain. but how the different combination of information sharing based ...
متن کاملThe patterns and behaviors of researchers’ knowledge sharing in scientific social networks:A Case Study of Research Gate’ Question And Answer System
Aim: Scientific social networks were shaped as part of a set of social software and a platform for international interactions sharing the tangible and intangible knowledge of researchers. The purpose is to investigate the patterns and behaviors of knowledge sharing of researchers in Research Gate. Based on this, the question and answer system of this scientific social network was analyzed and r...
متن کامل